Software Topological Message Routing and Aggregation Techniques for Large Scale Parallel Systems

نویسنده

  • Lukasz Wesolowski
چکیده

Supercomputing networks are designed to minimize message latency. The focus on low latency implies a best effort attempt to deliver each message injected onto the network as soon as possible. Ostensibly, prioritizing message latency is important, and there are numerous examples of applications that benefit from it, but while this latency-centric view of a network seems logical, it often leads to unintended consequences. It may at first appear surprising that one would find fault with the idea of trying to deliver messages as soon as possible, but if we take a broader view of communication and consider modern world systems for delivery and transportation, we will find that the micro-management of the latency-centric approach is hardly the norm. As an example, consider an urban rail transit system. Traveling by train requires being aware of the schedule and sharing travel space with other passengers. The time to reach the destination may be higher on average than by car, as it involves waiting at the station and changing trains if the destination does not happen to lie along the route of the initially boarded train. Unless one is traveling in a densely populated area, a train may not even be an option. The various restrictions and limitations, which are a source of inconvenience for the passengers, also allow trains to function more efficiently. In many ways a public transit system is an example of a system that deemphasizes latency, that is each individual’s travel time, to improve bandwidth, the number of people transported to their destination per unit time. Compared to the bandwidth-centric approach of a rail system, roads and highways are decidedly latencycentric. Driving a car involves few of the limitations of a public transit system, and under normal conditions leads to significantly shorter travel times. Despite the higher convenience of cars, public transportation remains an indispensable piece of infrastructure in large cities. Allowing large numbers of people to travel using a minimal real estate footprint makes public transit systems a key prerequisite for supporting dense population concentrations of large cities, where office and living space are stacked vertically, while transportation is often limited to a fraction of the space along the two dimensions of the street level. If instead of using public transportation, everyone chose to drive to work in these areas, traffic congestion would increase travel times several fold, and other than the very earliest travelers, most people upon arrival at their destination would find that they have nowhere to park. As latency-optimized systems, supercomputing networks are designed like highways for the data traveling between the increasingly densely populated computing resources of a large system. This work explores the idea of reorganizing the network in software to act more like the public transportation system in order to significantly improve the efficiency of handling a high volume of communication. Toward this goal, we introduce the Topological Routing and Aggregation Module (TRAM), a software communication system for efficiently handling high volume communication patterns, particularly ones involving a high volume of fine-grained communication.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Topological Message Aggregation Techniques for Large - Scale Parallel Systems

High overhead of fine-grained communication is a significant performance bottleneck for many classes of applications written for large scale parallel systems. This thesis explores techniques for reducing this overhead through topological aggregation, in which fine-grained messages are dynamically combined not just at each source of communication, but also at intermediate points between source a...

متن کامل

Parallel computation framework for optimizing trailer routes in bulk transportation

We consider a rich tanker trailer routing problem with stochastic transit times for chemicals and liquid bulk orders. A typical route of the tanker trailer comprises of sourcing a cleaned and prepped trailer from a pre-wash location, pickup and delivery of chemical orders, cleaning the tanker trailer at a post-wash location after order delivery and prepping for the next order. Unlike traditiona...

متن کامل

Optimizing Message Aggregation for Parallel Simulation on High Performance Clusters

High performance clusters (HPCs) based on commodity hardware are becoming more and more popular in the parallel computing community. These new platforms offer a hardware capable of a very low latency and a very high throughput at an unbeatable cost, making them attractive for a large variety of parallel and distributed applications. With adequate communication software, HPCs have the potential ...

متن کامل

A partition-based algorithm for clustering large-scale software systems

Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...

متن کامل

A Hard Real-Time Static Task Allocation Methodology for Highly-Constrained Message-Passing Environments

This paper presents a method of exploiting the functional parallelism present within a class of well-defined deterministic software systems to achieve real-time execution on a highlyconstrained MIMD message-passing architectural model. The methodology is targeted to software systems that do not respond well to conventional data parallel techniques because of the irregular flow of data resulting...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014